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1 SEQUENCE LISTING 
2 

3 (1) General Information: 
4 

5 (i) APPLICANT: Cech, Thomas R. 

6 Lingner, Joachim 

7 Nakamura , Toru 

8 Chapman, Karen B. 

9 Morin, Gregg B. 

10 Harley, Calvin 

11 Andrews, William H. 
12 

13 (ii) TITLE OF INVENTION: Novel Telomerase 
14 

15 (iii) NUMBER OF SEQUENCES: 171 
16 

17 (iv) CORRESPONDENCE ADDRESS: 

18 (A) ADDRESS: Town send and Townsend and Crew LLP 

19 (B) STREET: Two Embarcadero Center, 8th Floor 

20 (C) CITY: San Francisco 

21 (D) STATE: California 

22 <E) COUNTRY: United States of America 

23 (F) ZIP: 94111 
24 

25 (V) COMPUTER READABLE FORM: 

26 (A) MEDIUM TYPE: Floppy disk 

27 (B) COMPUTER: IBM PC compatible 

28 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

29 (D) SOFTWARE: Patentln Release #1.0, Version #1.30 
30 

31 (vi) CURRENT APPLICATION DATA: 

32 (A) APPLICATION NUMBER: US 08/846,017 

33 (B) FILING DATE: 25-APR-1997 

34 (C) CLASSIFICATION: 
35 

36 (vii) PRIOR APPLICATION DATA: 

37 (A) APPLICATION NUMBER: US 08/844,419 

38 (B) FILING DATE: 18-APR-1997 

39 (C) CLASSIFICATION: 
40 

41 (vii) PRIOR APPLICATION DATA: 

42 (A) APPLICATION NUMBER: US 08/724,643 

43 (B) FILING DATE: 01-OCT-1996 

44 (C) CLASSIFICATION: 
45 

46 (viii) ATTORNEY/ AGENT INFORMATION: 
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47 (A) NAME: Apple, Randolph T\ 

48 (B) REGISTRATION NUMBER: 36,429 

49 (C) REFERENCE/DOCKET NUMBER: 015389-002920US 
50 

51 <ix) TELECOMMUNICATION INFORMATION: 

52 (A) TELEPHONE: (415) 576-0200 

53 (B) TELEFAX: (415) 576-0300 
54 

55 (2) INFORMATION FOR SEQ ID NO:l: 
56 

57 (i) SEQUENCE CHARACTERISTICS: 

58 (A) LENGTH: 3279 base pairs 

59 (B) TYPE: nucleic acid 

60 (C) STRANDEDNESS: single 

61 (D) TOPOLOGY: linear 
62 

6 3 (ii) MOLECULE TYPE: other nucleic acid 

64 (A) DESCRIPTION: /desc = " DNA" 

65 

66 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

67 

68 AAAACCCCAA AACCCCAAAA CCCCTTTTAG AGCCCTGCAG TTGGAAATAT AACCTCAGTA 60 
69 

70 TTAATAAGCT CAGATTTTAA ATATTAATTA CAAAACCTAA ATGGAGGTTG ATGTTGATAA 120 
71 

72 TCAAGCTGAT AATCATGGCA TTCACTCAGC TCTTAAGACT TGTGAAGAAA TTAAAGAAGC 180 
73 

74 TAAAACGTTG TACTCTTGGA TCCAGAAAGT TATTAGATGA AGAAATCAAT CTCAAAGTCA 240 
75 

76 TTATAAAGAT TTAGAAGATA TTAAAATATT TGCGCAGACA AATATTGTTG CTACTCCACG 300 
77 

78 AGACTATAAT GAAGAAGATT TTAAAGTTAT TGCAAGAAAA GAAGTATTTT CAACTGGACT 360 
79 

80 AATGATCGAA CTTATTGACA AATGCTTAGT TGAACTTCTT TCATCAAGCG ATGTTTCAGA 420 
81 

82 TAGACAAAAA CTTCAATGAT TTGGATTTCA ACTTAAGGGA AATCAATTAG CAAAGACCCA 480 
83 

84 TTTATTAACA GCTCTTTCAA CTCAAAAGCA GTATTTCTTT CAAGACGAAT GGAACCAAGT 540 
85 

86 TAGAGCAATG ATTGGAAATG AGCTCTTCCG ACATCTCTAC ACTAAATATT TAATATTCCA 600 
87 

88 GCGAACTTCT GAAGGAACTC TTGTTCAATT TTGCGGGAAT AACGTTTTTG ATCATTTGAA 660 
89 

90 AGTCAACGAT AAGTTTGACA AAAAGCAAAA AGGTGGAGCA GCAGACATGA ATGAACCTCG 720 
91 

92 ATGTTGATCA ACCTGCAAAT ACAATGTCAA GAATGAGAAA GATCACTTTC TCAACAACAT 780 
93 

94 CAACGTGCCG AATTGGAATA ATATGAAATC AAGAACCAGA ATATTTTATT GCACTCATTT 840 
95 

96 TAATAGAAAT AACCAATTCT TCAAAAAGCA TGAGTTTGTG AGTAACAAAA ACAATATTTC 900 
97 

98 AGCGATGGAC AGAGCTCAGA CGATATTCAC GAATATATTC AGATTTAATA GAATTAGAAA 960 
99 
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100 GAAGCTAAAA GATAAGGTTA TCGAAAAAAT TGCCTACATG CTTGAGAAAG TCAAAGATTT 1020 
101 

102 TAACTTCAAC TACTATTTAA CAAAATCTTG TCCTCTTCCA GAAAATTGGC GGGAACGGAA 1080 
103 

104 ACAAAAAATC GAAAACTTGA TAAATAAAAC TAGAGAAGAA AAGTCGAAGT ACTATGAAGA 1140 
105 

106 GCTGTTTAGC TACACAACTG ATAATAAATG CGTCACACAA TTTATTAATG AATTTTTCTA 1200 
107 

108 CAATATACTC CCCAAAGACT TTTTGACTGG AAGAAACCGT AAGAATTTTC AAAAGAAAGT 1260 
109 

110 TAAGAAATAT GTGGAACTAA ACAAGCATGA ACTCATTCAC AAAAACTTAT TGCTTGAGAA 1320 
111 

112 GATCAATACA AGAGAAATAT CATGGATGCA GGTTGAGACC TCTGCAAAGC ATTTTTATTA 1380 
113 

114 TTTTGATCAC GAAAACATCT ACGTCTTATG GAAATTGCTC CGATGGATAT TCGAGGATCT 1440 
115 

116 CGTCGTCTCG CTGATTAGAT GATTTTTCTA TGTCACCGAG CAACAGAAAA GTTACTCCAA 1500 
117 

118 AACCTATTAC TACAGAAAGA ATATTTGGGA CGTCATTATG AAAATGTCAA TCGCAGACTT 1560 
. 119 

120 AAAGAAGGAA ACGCTTGCTG AGGTCCAAGA AAAAGAGGTT GAAGAATGGA AAAAGTCGCT 1620 
121 

122 TGGATTTGCA CCTGGAAAAC TCAGACTAAT ACCGAAGAAA ACTACTTTCC GTCCAATTAT 1680 
123 

124 GACTTTCAAT AAGAAGATTG TAAATTCAGA CCGGAAGACT ACAAAATTAA CTACAAATAC 1740 
125 

126 GAAGTTATTG AACTCTCACT TAATGCTTAA GACATTGAAG AATAGAATGT TTAAAGATCC 1800 
127 

128 TTTTGGATTC GCTGTTTTTA ACTATGATGA TGTAATGAAA AAGTATGAGG AGTTTGTTTG 1860 
129 

130 CAAATGGAAG CAAGTTGGAC AACCAAAACT CTTCTTTGCA ACTATGGATA TCGAAAAGTG 1920 
131 

132 ATATGATAGT GTAAACAGAG AAAAACTATC AACATTCCTA AAAACTACTA AATTACTTTC 1980 
133 

134 TTCAGATTTC TGGATTATGA CTGCACAAAT TCTAAAGAGA AAGAATAACA TAGTTATCGA 2040 
135 

136 TTCGAAAAAC TTTAGAAAGA AAGAAATGAA AGATTATTTT AGACAGAAAT TCCAGAAGAT 2100 
137 

138 TGCACTTGAA GGAGGACAAT ATCCAACCTT ATTCAGTGTT CTTGAAAATG AACAAAATGA 2160 
139 

140 CTTAAATGCA AAGAAAACAT TAATTGTTGA AGCAAAGCAA AGAAATTATT TTAAGAAAGA 2220 
141 

142 TAACTTACTT CAACCAGTCA TTAATATTTG CCAATATAAT TACATTAACT TTAATGGGAA 2280 
143 

144 GTTTTATAAA CAAACAAAAG GAATTCCTCA AGGTCTTTGA GTTTCATCAA TTTTGTCATC 2 340 

145 

14 6 ATTTTATTAT GCAACATTAG AGGAAAGCTC CTTAGGATTC CTTAGAGATG AATCAATGAA 2400 
147 

14 8 CCCTGAAAAT CCAAATGTTA ATCTTCTAAT GAGACTTACA GATGACTATC TTTTGATTAC 2460 
149 

150 AACTCAAGAG AATAATGCAG TATTGTTTAT TGAGAAACTT ATAAACGTAA GTCGTGAAAA 2520 
151 

15 2 TGGATTTAAA TTCAATATGA AGAAACTACA GACTAGTTTT CCATTAAGTC CAAGCAAATT 2580 
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153 

154 TGCAAAATAC GGAATGGATA GTGTTGAGGA GCAAAATATT GTTCAAGATT ACTGCGATTG 2640 
155 

156 GATTGGCATC TCAATTGATA TGAAAACTCT TGCTTTAATG CCAAATATTA ACTTGAGAAT 2700 
157 

158 AGAAGGAATT CTGTGTACAC TCAATCTAAA CATGCAAACA AAGAAAGCAT CAATGTGGCT 2760 
159 

160 CAAGAAGAAA CTAAAGTCGT TTTTAATGAA TAACATTACC CATTATTTTA GAAAGACGAT 2820 
161 

162 TACAACCGAA GACTTTGCGA ATAAAACTCT CAACAAGTTA TTTATATCAG GCGGTTACAA 2880 
163 

164 ATACATGCAA TGAGCCAAAG AATACAAGGA CCACTTTAAG AAGAACTTAG CTATGAGCAG 2 940 
165 

166 TATGATCGAC TTAGAGGTAT CTAAAATTAT ATACTCTGTA ACCAGAGCAT TCTTTAAATA 3000 
167 

168 CCTTGTGTGC AATATTAAGG ATACAATTTT TG GAG AG GAG CATTATCCAG ACTTTTTCCT 3060 
169 

170 TAGCACACTG AAGCACTTTA TTGAAATATT CAGCACAAAA AAGTACATTT TCAACAGAGT 3120 
171 

172 TTGCATGATC CTCAAGGCAA AAGAAGCAAA GCTAAAAAGT GACCAATGTC AATCTCTAAT 3180 
173 

174 TCAATATGAT GCATAGTCGA CTATTCTAAC TTATTTTGGA AAGTTAATTT TCAATTTTTG 3240 
175 

176 TCTTATATAC TGGGGTTTTG GGGTTTTGGG GTTTTGGGG 327 9 

177 

178 (2) INFORMATION FOR SEQ ID NO: 2: 
179 

180 (i) SEQUENCE CHARACTERISTICS: 

181 (A) LENGTH: 1031 amino acids 

182 (B) TYPE: amino acid 

183 (C) STRANDEDNESS : Not Relevant 

184 (D) TOPOLOGY: Not Relevant 
185 

186 (ii) MOLECULE TYPE: protein 

187 

188 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

189 

190 Met Glu Val Asp Val Asp Asn Gin Ala Asp Asn His Gly lie His Ser 

191 15 10 15 
192 

193 Ala Leu Lys Thr Cys Glu Glu lie Lys Glu Ala Lys Thr Leu Tyr Ser 

194 20 25 30 
195 

196 Trp lie Gin Lys Val lie Arg Cys Arg Asn Gin Ser Gin Ser His Tyr 

197 35 40 45 
198 

199 Lys Asp Leu Glu Asp lie Lys lie Phe Ala Gin Thr Asn lie Val Ala 

200 50 55 60 
201 

202 Thr Pro Arg Asp Tyr Asn Glu Glu Asp Phe Lys Val lie Ala Arg Lys 

203 65 70 75 80 
204 

205 Glu Val Phe Ser Thr Gly Leu Met lie Glu Leu lie Asp Lys Cys Leu 
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206 85 90 95 

207 

208 Val Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin 

209 100 105 110 
210 

211 Cys Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys Thr His Leu 

212 115 120 125 
213 

214 Leu Thr Ala Leu Ser Thr Gin Lys Gin Tyr Phe Phe Gin Asp Glu Trp 

215 130 135 140 
216 

217 Asn Gin Val Arg Ala Met lie Gly Asn Glu Leu Phe Arg His Leu Tyr 

218 145 150 155 160 
219 

220 Thr Lys Tyr Leu lie Phe Gin Arg Thr Ser Glu Gly Thr Leu Val Gin 

221 165 170 175 
222 

223 Phe Cys Gly Asn Asn Val Phe Asp His Leu Lys Val Asn Asp Lys Phe 

224 180 185 190 
225 

226 Asp Lys Lys Gin Lys Gly Gly Ala Ala Asp Met Asn Glu Pro Arg Cys 

227 195 200 205 
228 

229 Cys Ser Thr Cys Lys Tyr Asn Val Lys Asn Glu Lys Asp His Phe Leu 

230 210 215 220 
231 

232 Asn Asn lie Asn Val Pro Asn Trp Asn Asn Met Lys Ser Arg Thr Arg 

233 225 230 235 240 
234 

235 lie Phe Tyr Cys Thr His Phe Asn Arg Asn Asn Gin Phe Phe Lys Lys 

236 245 250 255 
237 

238 His Glu Phe Val Ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala 

239 260 265 270 
240 

241 Gin Thr lie Phe Thr Asn lie Phe Arg Phe Asn Arg lie Arg Lys Lys 

242 275 280 285 
243 

244 Leu Lys Asp Lys Val lie Glu Lys lie Ala Tyr Met Leu Glu Lys Val 

245 290 295 300 
246 

247 Lys Asp Phe Asn Phe Asn Tyr Tyr Leu Thr Lys Ser Cys Pro Leu Pro 

248 305 310 315 320 
249 

250 Glu Asn Trp Arg Glu Arg Lys Gin Lys lie Glu Asn Leu lie Asn Lys 

251 325 330 335 
252 

25 3 Thr Arg Glu Glu Lys Ser Lys Tyr Tyr Glu Glu Leu Phe Ser Tyr Thr 

254 340 345 350 

255 

256 Thr Asp Asn Lys Cys Val Thr Gin Phe lie Asn Glu Phe Phe Tyr Asn 

257 355 360 365 
258 
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Application No.: 

NOTICE TO COMPLY WITH REQUIREMENTS FOR PATENT APPLICATIONS CONTAINING 
NUCLEOTIDE SEQUENCE AND/OR AMINO ACID SEQUENCE DISCLOSURES 

The nucleotide and/or amino acid sequence disclosure contained in this application does not 
comply with the requirements for such a disclosure as set forth in 37 C.F.R. 1.821 - 1.825 for 
the following reason(s): 

rn 1. This application clearly fails to comply with the requirements of 37 C.F.R. 1.821-1.825. Applicant's 
Mi attention is directed to these regulations, published at 1114 OG 29, May 15, 1990 and at 55 FR 
I 18230, May 1, 1990. 

□ 2. This application does not contain, as a separate part of the disclosure on paper copy, a "Sequence 
Listing" as required by 37 C.F.R. 1.821(c). 

□ 3. A copy of the "Sequence Listing" in computer readable form has not been submitted as required by 
37 C.F.R. 1.821(e). 



□ 



4. A copy of the "Sequence Listing" in-computer readable form has been submitted. However, the 
content of the computer readable form does not comply with the requirements of 37 C.F.R. 1.822 
and/or 1.823, as indicated on the attached copy of the marked -up "Raw Sequence Listing." 



□ 5. The computer readable form that has been filed with this application has been found to be damaged 
and/or unreadable as indicated on the attached CRF Diskette Problem Report. A Substitute 
computer readable form must be submitted as required by 37 C.F.R. 1.825(d). 

□ 6. The paper copy of the "Sequence Listing" is not the same as the computer readable from of the 
"Sequence Listing" as required by 37 C.F.R. 1.821(e). 

n 



7. Other: 



Applicant Must Provide: 

An initial or substitute computer readable form (CRF) copy of the "Sequence Listing". 



An initial or substitute paper copy of the "Sequence Listing", as well as an amendment directing its 
entp/Tntb the specification. " — 



O A statement that the content of the paper and computer readable copies are the same and, where 
LO applicable, include no new matter, as required by 37 C.F.R. 1.821(e) or 1.821(f) or 1 821(a) or 
1.825(b) or 1.825(d). 

For questions regarding compliance to these requirements, please contact: 

For Rules Interpretation, call (703) 308-4216 
For CRF Submission Help, call (703) 308-4212 
For Patentln software help, call (703) 308-6856 

PLEASE RETURN A COPY OF THIS NOTICE WITH YOUR RESPONSE 



